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METHOD AND SYSTEM FOR TRANSMITTING MESSAGES ON TELECOMMUNICATIONS 
NETWORK AND RELATED SENDER TERMINAL 

CROSS REFERENCE TO RELATED APPLICATIONS 
This application is the US national phase of PCT 
s application PCT/EP2003/008604 , filed 4 August 2003, published 4 
March 2004 as WO 2004/019583, and claiming the priority of 
Italian patent application TO2002A000724 itself filed 14 August 
2002, whose entire disclosures are herewith incorporated by 
reference . 

10 FIELD OF THE INVENTION 

The present invention relates to the transmission of 
messages on telecommunication networks. 

BACKGROUND OF THE INVENTION 
The introduction of new generation mobile terminals , 
is for instance according to the UMTS standard (Universal Mobile 

Telecommunications System) or the GSM/ GPRS standard (acronyms for 
Global System for Mobile communications and General Packet Radio 
Service) has enabled the transmission and presentation on 
terminal of messages with multimedia content comprising different 
20 elements, such as text, sounds and images, also in motion, the 
messages are currently indicated as MMS, acronym for Multimedia 
Messaging System. 

The capability of transmitting the messages gives rise 
to different kinds of problems. 
25 In the first place, it is necessary to ensure that the 

messages can be constructed with relative ease by using an 
apparatus, like a mobile telephone, which, due to the reduced 
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size and processing capacity, is not ideally suited for 
generating messages with complex content. 

In the second place, it is desirable for terminals with 
the ability to transmit and receive MMS messages to be able to 
s coexist and interact with old generation terminals such as mobile 
terminals operating according to the GSM standard, able to 
generate only text messages of the type currently called SMS, 
acronym for Short Message Service. It is reasonable to think 
that the two technologies are destined to coexist for a fairly 
10 long time before all currently circulating terminals are 
replaced. 

OBJECT OF THE INVENTION 
The object of the present invention is to favor the 
coexistence and the interaction between terminals with the 
15 ability of transmitting text messages like SMS message and 
terminals able to receive MMS messages. 

SUMMARY OF THE INVENTION 
According to the present invention, this object is 
achieved thanks to a method with the characteristics specifically 
20 set out below. The invention also includes the related system as 
well as the corresponding sender terminal . 

In essence, the solution according to the invention 
allows old generation terminals - able to send SMS text messages 
- to induce the generation of messages with multimedia content, 
25 destined to MMS terminals. 

In the currently preferred embodiment, the solution 
according to the invention allows to provide a service that 
automatically transforms a pure text message into a multimedia 
message, hence into a "richer" message than the starting message, 
30 constituted by the pure text. 
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In the currently preferred embodiment, the solution 
according to the invention provides for using the system for the 
automatic automation of three-dimensional characters based on 
text or natural audio produced by the same Applicant and 
s identified by the registered trademark JoeXpress®. 

In this regard it is useful to consult the documents 
EP-A-0 991 023 (US 6,532,011), EP-A-0 993 197 (6,665,643) and 
WO-A-01/75805 (7,123,262). The system in question is able to 
transform a text or a recorded voice into the movements of a 

10 character who enunciates the processed sentences . These 

movements also include movements that are not linked with the 
spoken word, with facial expressions and body motions. The 
system is also able to handle other elements such as the 
personalization of the character's appearance (for example, the 

is color of the hair, of the eyes, the way it is dressed, etc.), the 
place where the character is positioned, the movement of the 
viewing point, the background music. All concurs in the 
construction of a video clip from a restricted number of input 
parameters provided. 

20 In this way, the solution according to the invention 

allows, for instance, to generate animations destined to MMS 
terminals on the basis of the text contained in a starting SMS 
message. In this case, the result is an MMS message comprising 
different parts, such as the scene description part (in 

25 "Synchronized Multimedia Integration Language" or SMIL) and the 
parts containing the multimedia objects to be inserted in the 
message, among which are automatically generated animations. 

The first generation of MMS terminals is subject to 
fairly stringent constraints on message content: in particular, 

30 video is not supported and the maximum size of the messages is 30 
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kBytes . A preferred embodiment of the solution according to the 
invention therefore allows to incorporate in the generated MMS 
message an animation with small size. In particular, the video 
is transformed into an image according to the GIF standard 
s (acronym for Graphics Interchange Format) subjected to animation 
using a rather low animation sampling rate, i.e. around one Hz. 

Moreover, the original text is subdivided among the 
various frames of the sequence. By doing so, with animations 
having, for example, sizes in the order of 100x80 pixels (the 

10 dimensions of the display units of currently marketed MMS 
terminals) one can generate messages containing animations 
lasting about 15 second, with complex models and scenarios, or 
longer in the case of simpler models, which allow a higher 
compression ratio within the animated GIF image. 

is If the total size of the message is limited (for 

instance, to 30 kBytes) making it problematic to transmit both 
video and audio, it is possible to cause the terminal, during the 
viewing of the animated GIF image, to reproduce, instead of a 
voice message, a melody inserted in the message: this type of 

20 sound ("ringer") is able to be contained in a very small number 
of bytes. 

In the presence of less strict constraints on the size 
of the message, the solution according to the invention allows to 
transmit, instead of text inside the frames or even in parallel 
25 therewith, the audio associated with the animation, generated for 
instance by a voice synthesizer. 

In this scenario, it is possible automatically to 
generate an MMS message even from natural audio, in which case 
the animation is guided by the result of the process carried out 
30 by a phonetic recognizer. Voice synthesizers and phonetic 
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recognizers able to carry out the functions described above are 
currently available in the art. 

In addition to animation, the MMS message can 
advantageously contemplate a part destined to contain more text, 
s melodies and images, useful for inserting, for instance, 
so-called "logos" and/or advertising slogans. 
BRIEF DESCRIPTION OF DRAWINGS 

The invention shall now be described purely by way of 
non limiting example with reference to the accompanying drawings , 
10 in which: 

FIG. 1 shows, at functional architecture levels, the 
structure of a system able to operate according to the invention, 

FIG. 2 is a flow chart illustrating the steps for 
transmitting a message according to the invention, and 
is FIGS. 3A and 3B show two contiguous parts of a 

functional block diagram illustrating a possible form of 
arrangement of the system according to the invention. 

BEST MODE FOR CARRYING OUT THE INVENTION 
The description provided herein refers to the 
20 application scenario which, at least at present is the most 

attractive one for the possible use of the invention, i.e. the 
conversion of text messages generated as SMS messages in a GSM 
mobile terminal into MMS messages destined to be transmitted on a 
network operating according to the UMTS standard. 
25 In any case, the solution according to the invention is 

also applicable to text messages generated differently, for 
instance in the form of email messages, and it can be used to 
transmit MMS messages on any type of network such as to support 
such a transmission, hence without limitation to UMTS networks. 
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In the diagram of FIG. 1, the numeric reference 10 
globally indicates a module having the function of MMS 
relay/server and comprising for this purpose a sub-module with 
relay function, indicated as 101, and a sub-module with server 
s function, indicated as 102, mutually connected through an 

interface indicated as 103. Naturally, the sub-modules 102 and 
103 can also be mutually integrated. 

The numeric reference 11 instead indicates a database 
of the users of an MMS service. This is substantially a database 
10 where, for each user to whom the MMS service is made available, 
the telephone number (or an equivalent indication) and the 
information about the terminal type employed by the user in 
question are recorded. 

The database 11 is connected to the module 10 through 
is an interface 111. 

The numeric references 12 and 13 indicate two users 
connected in a network to the module 10 (this can typically take 
place through an UMTS network) so as to be able to receive MMS 
messages . 

20 The user indicated as 12 is a user directly included in 

the network whereto the module 10 is attached. The related 
connection therefore is of the direct type, through an interface 
indicated as 121. 

The user indicated as 13, instead, is a user nominally 

25 attached to another mobile network. 

In this case, the connection to the module 10 is not 
direct but is achieved through an additional module 10 ' 
substantially similar to the module 10, by means of corresponding 
interfaces indicated as 131a and 131b. 
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The distinct representation of the user 12 and of the 
user 13 is destined to highlight the possibility of applying the 
solution according to the invention also in a context in which 
multiple telecommunication networks mutually co-operate in a 
s general internetworking or roaming scenario. 

The reference 14 indicates a server, such as an 
electronic mail server, connected to the module 10 through a 
respective interface 141 in order to be able to operate as a 
recipient of MMS messages. 
10 Lastly, the reference 15 indicates the system for 

billing the rendering of the MMS message services, connected to 
the module 10 through a respective interface 151. 

The system architecture and the various constitutive 
elements described heretofore correspond to solutions to be 
is considered wholly known in the art. These solutions are already 
able to be used for sending MMS messages within 

telecommunications networks (such new generation mobile networks 
operating according to the UMTS standard) . This fact makes it 
superfluous to provide herein a more detailed description of the 

20 architecture and of the elements in question. 

An important characteristic of the solution according 
to the invention is given by the fact that to the module 10 it is 
associated, preferably through a respective interface 161, a 
module or sub-system 16 able to convert text-only messages, such 

25 as SMS messages coming from an SMS message management center 17 
(usually called with the acronym SMSC) into messages with 
multimedia content. After possible further processing in module 
10, the messages can be broadcast by the module 10 in the form of 
MMS messages destined to users such as the users 12,13 and 14 

30 indicated in FIG. 1. 
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In particular, the module 10 can be configured in such 
a way as to allow the transmission of a determined message MMS to 
multiple recipients or to a list of recipients . 

Consequently, though hereinafter reference shall be 
s made nearly exclusively to the generation, from an SMS message, 
of an MMS message sent to a single recipient, the solution 
according to the invention is easily suited to allow the MMS 
message in question to be broadcast to a list of recipients 
defined for instance by means of an http request or by means of 
10 an ftp request sent to the module 10. 

As stated previously, the core of the module 16 is 
constituted by the system for the creation of multimedia content 
represented by virtual characters animated by text or natural 
voice. An example of such a system is the JoeXpress® system, 
is mentioned above. 

Such a system enables a user to select a virtual 
character, its background, any personalizations, the format in 
which the content is to be produced. The selected parameters are 
used to produce animations with the desired context and format. 
20 The flowchart of FIG. 2 shows the steps of the process 

whereby a system according to the invention is accessed by a 
user, indicated as 18 in FIG. 1, who acts as a "sender." The 
user 18 has a terminal able to send SMS messages to a 
corresponding center able to handle this type of messages, such 
25 as the center indicated as 17 in FIG. 1. 

Starting from an initial step, indicated as 200, the 
reference 202 indicates the step in which the user 18 composes on 
his/her terminal an SMS message (with the characteristics better 
illustrated hereafter) sending it to a telephone number 
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associated with the service which forwards the SMS message after 
providing it with MMS characteristics . 

The service in question is implemented mainly by the 
module indicated as 16, but some functions can be performed by 
s the module 10 and, possibly, by the module 17. 

In the step indicated as 204 in FIG. 2, the service 
management function-hence essentially the module 16- generates 
the request for the emission of an MMS message corresponding to 
the received SMS message. As will be explained better hereafter, 

10 such a request contains, in addition to the message itself, also 
the user's identifier and (possibly) information pertaining to 
the type of recipient terminal. 

In the step indicated as 206, the module 16 processes 
the request received, generating an MMS message adapted to the 

is graphic and processing capacity characteristics of the recipient 
terminal. In the step indicated as 208, the MMS message is sent 
to a corresponding MMS center (such as the module 10) which, in a 
subsequent step 208, forwards the message to the recipient 
terminal, such as the terminal 12,13 or 14. 

20 The step 210 indicates the step in which the message is 

presented to the recipient terminal according to the typical 
modes of presentation of an MMS. Once the transmission is 
completed with the reading of the MMS message, the system moves 
to a conclusive step, indicated as 212. 

25 The telephone number associated with the service, 

destined to be dialed by the user 18 in the step 202 is 
preferably a dedicated telephone number of the kind usually 
called "large account." 

The sequence of characters sent by the user contains, 

30 in addition to the text of the message, also some information in 
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the header such as the telephone number of the recipient of the 
MMS message (users 12,13, 14 of the diagram of FIG. 1), the 
virtual character that will reproduce the message and the 
background into which it will be inserted, 
s The last two information items are optional and can 

therefore be omitted. In case of omission, corresponding 
information are selected automatically by the module 16, for 
instance as a random choice or as a predefined choice (default) . 
Naturally, this can be applied even for only part of the 
10 information: for instance, if only the character is specified, 
the module 16 automatically selects the background. 

The sequence of characters sent to the service 
therefore usually has the following form: 

<recipient telephone number> [<virtual character 
is [<background>] ] 

<text message> 

In the step 202 the header of the message can be 
composed either manually or by means of a script residing on the 
terminal 18 which allows to select the virtual character and the 
20 background by means of a menu and the recipient from the address 
book. 

If the message is dialed manually, the sequence of 
characters can contain errors . For example , the user could 
specify the name of a non-existing virtual character or 
25 background. In this case, the service replaces the faulty 
information by automatically selecting correct options . 

It will be appreciated that the script functions 
correspond essentially to functions provided in some mobile 
telephony terminals for sending SMS messages, with the 
30 possibility to load the related software remotely in the 
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individual terminal 18 (in particular in the Subscriber Identity 
Module or SIM of the terminal) by the same service management 
system. 

The module for transforming the SMS text format into 
5 MMS multimedia format, preferably based on the JoeXpress® systems 
already mentioned several times above, is preferably used in the 
mode called "text animation." 

In this case, the text of the SMS message is processed 
by a voice synthesizer which transforms the text into voice and 
10 provides the timed phonetic sequence, which is then used for the 
automatic generation of the speech movements of the selected 
virtual character. The text provided as an input to the SMS /MMS 
conversion module may contain meta- information that have an 
influence over the resulting animation, adding expressions and 
is gestures to the virtual characters and altering the synthetic 
voice . 

The meta- information is inserted in the text as sequences of 
characters that can have, for instance, the following form: 

<tagXaction_type>[<parl>] [<par2>] . . . [<parn>] 
20 where : 

<tag> is necessary to distinguish the meta- information 
from the text to be synthesized 

<action-type> specifies which action is to be executed. 
Examples of actions are: change in voice timbre, reproduction of 
25 a facial expression or of a body movement, change in viewpoint, 
etc. 

<parl-n> is the parameter that modifies the action, for 
instance the alteration of the duration of a facial expression. 
An alternative representation at higher level is 
30 constituted by the so-called "emoticons," i.e. by sequences of 
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characters commonly used in Internet in text communications , 
which represent emotional states. Examples of emoticons are : " 
;-)", " :-)", " :-0", etc. 

Emoticons are transformed by the system into a 
s semantically equivalent form using the representation described 
above. Support for the emoticons is motivated by the fact that 
they are familiar to users and simple to insert in the text, 
while having the same flexibility as low-level representation. 

A system like the JoeXpress® system produces animations 
10 of three-dimensional models that can be translated by the system 
into different formats, classifiable in two categories depending 
on whether the three-dimensional information is retained or not. 

To the first category belong, for instance, the 
sequences of MPEG- 4 Face and Body Animation parameters, VRML 
is animations (acronym for Virtual Reality Modeling Language) , 3D 
Studio Max animations etc. 

To the second category belong the video coding formats 
like MPEG-1, MPEG-2, MPEG-4 video, animated GIF (while it is not 
a video coding format in the strict sense of the term, the 
20 GIF- 8 9a format does allow to create image sequences) . 

The audio of the animation can be encoded together with 
the video or separately as in the case of VRML or animated GIF. 

Due to the limits in the terminals of the transmission 
network, multimedia contents are subject to constraints such as 
25 the maximum size of the message, spatial resolution, time 
resolution, and the type of coding of the animation. 

For this reason, in addition to the text of the message 
and to the identifier of the sender, it is necessary to take into 
account the type of terminal whereto the multimedia message is to 
30 be transferred. 
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The terminal type essentially identifies the class of 
the terminal (in essence, characteristics such as storage <BR> 
<BR> capacity, display size, etc.) and any other constraints due 
to the transmission network, 
s The MMS message destined to be produced in a system 

according to the invention is therefore conditioned to exploit 
the available resources most efficiently, within the imposed 
constraints . 

This requirement can be met in at least two different 

10 ways . 

A first way provides for the request to create the MMS 
message, generated at step 204, to contain, in addition to the 
text of the message and the sender's identifier, also information 
indicating the class whereto the message to be generated must 

is belong, i.e. the type of terminal whereto the MMS message is 

destined and hence its performance characteristics. The video 
content destined to integrate the SMS textual message is then 
generated according to the recipient terminal type, i.e. in such 
a way as to cause the MMS message (derived from the multimedia 

20 message obtained by integrating the video content and the SMS 
message) to be directly compatible with the characteristics of 
the MMS terminal destined to receive the multimedia message. 

When this solution is adopted, the module 16 is able to 
search, based on the recipient's identifier, the terminal type 

25 information stored in the database 11. The connection between 

the module 16 and the database 11 can be either of the direct or 
of the indirect type, through the module 10, according to the 
criteria whereto FIG. 1 refers. 

A second way to obtain the same result provides for the 

30 multimedia video content (destined to be added to the SMS 
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message) to be generated by the module 16 on the basis of 
criteria that are standard, hence independent from the type of 
terminal whereto the message is destined to be transmitted. 

The multimedia message deriving from the integration 
s between the SMS textual message and the standard multimedia video 
content is forwarded by the module 16 to the module 10 which, 
reading the information about the recipient terminal from the 
database 11, "specializes" the MMS message derived from the 
multimedia message, adapting it to the characteristics of the 
10 recipient terminal . 

The choice to adopt one or the other solution is 
primarily dictated by application considerations . 

The first solution has, at least in principle, the 
advantage of not entailing the generation of information destined 
is to be discarded when the message is adapted to the requirements 
of the recipient terminal. However, this advantage is offset by 
the need to ensure that the module 16 is able to receive the 
information about the type of terminal, residing in the database 
11. 

20 The second solution has the advantage that it exploits 

the availability of the information of the database 11 at the 
level of the module 10, already normally provided for current MMS 
applications. In current MMS applications, the module 10 is 
already capable of achieving a specialization of the forwarded 

25 MMS messages according to the characteristics of the recipient 

terminal. The advantages indicated above, however, are at least 
marginally tempered by the fact that this solution entails the 
generation, by the module 16, of information destined to be 
discarded. 
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Whichever solution is adopted, it is possible to 
benefit from the fact that the same animation can be represented 
in an MMS message in substantially different manners. 

For instance, one can make use, as stated previously, 
s of an animated GIF image with a low number of frames per second, 
in which case each frame shows the text of the message pronounced 
at that instant by the character. This particularly compact 
representation is well suited for situations in which the message 
size constraints are particularly stringent, or when the 

10 recipient terminal is not able to show a video. 

Alternatively, one can employ an animated GIF image, 
with compressed audio. In this case, the synthesized voice, 
possibly complete with scene audio, is also included in the 
message. This is a useful representation for terminals that do 

is not support video but are able to handle audio, when the size of 
the message is sufficiently large to contain both the moving 
image and the audio track. 

An additional alternative is represented by a video 
clip complete with audio. In this case, an animation is obtained 

20 that can be more fluid in its motions thanks to the higher 

compression ratio offered by a video coding with respect to an 
animated GIF image and to the higher number of frames 
consequently used in the animation. This solution can be adopted 
with terminals that are able to support video coding. 

25 It should be stressed that the ways to package the 

message recalled above are mere examples, and they are far from 
being exhaustive of the possibilities offered by the solution 
according to the invention. 
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The description will now be provided, with reference to 
FIGS. 3A and 3B, of a possible architectural arrangement of the 
module indicated as 16 in Figure 1. 

The block or module 300 is destined to receive as its 
s input the SMS message substantially as transmitted by the 

terminal 18 and to perform thereon the operation of extracting 
the information from the header. 

As previously seen, the first part of the text is 
represented by a header containing the number of the recipient 
10 terminal (for instance, with reference to the diagram of Figure 
1, the terminal 12, the terminal 13 or the terminal 14) and, 
optionally, the indication of the character and of the background 
which the sender user wants to use to generate the video content. 
These data are divided from the actual message by a separator 
is character . 

The message can contain low or high-level 
me ta- information (for instance the so-called emoticons) which 
influence the resulting animation. 

As an example of such text, one can consider the 

20 string: 

"3356121180 Morpheus Country@Hi ! I'm at the beach:-) 
but I'm getting bored without you. \kyawn, 150." 

In the example, the separator used is the character - 
Associated with the message in question are the identifier of the 
25 sender as well as, possibly, the string indicating the 
recipient ' s terminal model . 

The reference 302 indicates the database of the module 
16 which, in the preferred implementation based on the JoeXpress® 
system, contains information such as the list of characters 
30 usable for generating the video content, the languages associated 
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with them, the available scenarios, etc. The database 302 also 
contains the three-dimensional models of the characters and of 
the backgrounds . 

Co-operating with the database 302, the block 300 
s extracts from the message header information such as the 
recipient's identifier, as well as the character and the 
background to be used to create the video content. 

The block 300 then communicates with the database 302 
that contains the character list, voices, available backgrounds 
10 and, if these information are omitted or erroneous in the header 
of the received SMS message, the block 300 automatically selects 
correct options . 

The block 300 generates at its outputs the following 
data/information : 

is the text of the message without the header ("Hi ! I'm 

at the beach :-) but I'm getting bored without you. \ kyawn, 
150") destined to be sent to an additional block 302 whose 
function shall become more readily apparent hereafter; 

the name of the character P, protagonist of the 

20 animation (in the example illustrated herein, the name is 
"Morpheus") , 

the language L associated with the character (for 
instance, English) , 

the background A corresponding to the scenario in which 
25 the virtual character P is to be placed (in the example 

considered herein, the background is a "country" background) , and 

the identifier of the recipient D (constituted, in the 
illustrated example, by the number 3356121180) . 

Starting from the text of the message M received from 
30 the block 300, the block 302 transforms the emoticons into 
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me ta- information capable of being used by the information system 
that simultaneously determines what text will be inserted in the 
frames constituting the animation of the MMS message constituting 
the output of the module 16. 
s Therefore, the output of the block 302 is constituted 

both by a text TBS with low-level information, i.e. a text in 
which emoticons are replaced with low-level meta- information 
(""Hi! I'm at the beach \ksmile but I'm getting bored without 
you. \ kyawn, 150") , and a text TE in which all low-level 

10 information has been eliminated, retaining only what will be the 
by the character plus the emoticons ("Hi ! I'm at the beach :-) 
but I'm getting bored without you.") . 

The text TBS generated by the block 302 is sent to a 
block 304 destined to extract the list of actions contained in 

is the text and to prepare the text in the form used by a voice 

synthesizer 306 in such a way as to obtain also the timing to be 
associated with the these actions. 

The block 304 transmits to the synthesizer 306 a text 
TAG in which the low-level meta- information are replaced with 

20 "tags" of the voice synthesizer (text- to- speech) . These tags are 
sequences of characters identified by the synthesizer as special 
information and used either to alter the synthesized voice or to 
obtain from the synthesizer 306 the time instants associated with 
the tags in the synthesized sentence, the time instants are used 

25 to determine the timing of the actions . 

The block 304 also generates as an additional output a 
signal TA substantially corresponding to a list of the actions 
contained in the text, complete with any parameters. 

Referring to the SMS message mentioned several times 

30 above, there are essentially two actions contained, i.e.: 
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smile , and 
yawn, 150. 

The parameter 150 modifies the duration of the "yawn" 
action with respect to a standard duration, 
s The voice synthesizer 306 transforms into a voice 

signal the text TAG received from the block 304 using the 
selected language identified by the signal L generated by the 
block 300. 

In addition to the voice signal, the block 306 also 

10 produces the timed phonetic sequence FT, used as the basis of the 
construction of the movement of the spoken word. It should be 
recalled that the timed phonetic sequence is the sequence of 
phonemes constituting the spoken sentence, integrated with the 
time instances whereat the phonemes are spoken. 

is The signal indicated as V is, instead, the actual 

synthesized voice signal. 

The blocks indicated with the references 308 and 310 
are engines that supervise the animation of the spoken word and 
the corresponding facial and body animation of the character used 

20 for the video content. 

The block 308 receives as an input the phonetic 
sequence FT transforming it into a "visemic" sequence, i.e. into 
the movement produced by the face as it speaks . 

To obtain a realistic movement, the animation engine 

25 considers the mutual influence effect of adjacent phonemes, the 
co-articulation phenomenon. The movement produced is 
three-dimensional and the related output signal AP is constituted 
by animation parameters that describe the movement of the spoken 
word in three- dimensional fashion and independently from the 

30 character . 
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This means that such parameters are successively 
applicable to characters with any shape and complexity, human and 
otherwise . 

The block 310, serving as facial and body animation 
s engine operates on the basis of the list of actions corresponding 
to the signal TA generated by the block 304 integrated in a 
virtual summation node 312 with the information on the timing of 
the actions, generated by the synthesizer 306. 

The block 310 operates in co-ordinated fashion with an 

10 additional database 314 which contains sequences of facial and 
body movements in the form of animation parameters independent 
from the character, thus similar in this regard to the parameters 
output by the block 308. In the example, the sequences "smile" 
and "yawn" are two movements drawn from the database 314 . 

is The facial and body 310 animation block unites the 

individual actions corresponding to the various movements that 
the character will have to perform, creating a single sequence of 
animation parameters . The individual movements are altered based 
on any parameters associated therewith. 

20 The movements also undergo automatic variations in 

intensity, duration, specular characteristics, etc. to enhance 
variety. Lastly, some movements executed by the characters but 
not explicitly indicated, such as blinking eyelids, are also 
added . 

25 The output of the block 310 is constituted by a signal 

AFC representative of animation parameters that describe the 
movement of the spoken word in three-dimensional fashion, 
independently from the character, the parameters are, therefore, 
successively applicable to characters with any shape and 

30 complexity, human and otherwise, such as animals. 
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A successive block indicated as 316 has the task of 
mixing the movements of the spoken word (signal AP) with the 
other movements (signal AFC) to obtain a realistic result. The 
operation of the block 316 is based on a logic that takes into 
s account the priorities of movements that may be contrasting, such 
as speaking a plosive phoneme (such as the letter "p") and 
yawning. The resulting movement is three-dimensional. 

The output signal of the block 316 is constituted by a 
signal AIP representative of an animation independent from the 
10 character . 

The signal AIP is fed to a block 318 that transforms 
the independent animation (signal AIP) into the movement of the 
character selected on the basis of the signal P extracted from 
the block 300. The resulting movement is dependent on the 
is topology of the model. The model associated with the character 
is, as seen previously, contained in the database 302. 

The output signal of the block 318 is constituted by a 
signal ADP identifying the sequence of movements of the selected 
character . 

20 The signal ADP in question is fed to a block 320 that 

merges the signal ADP with the background information A that 
comes from the block 300 with additional information on the 
characters and on the backgrounds drawn directly from the 
database 302 . 

25 All this in order to add to the animation of the 

character also the remaining animations which may be present in 
the scene (signal A) and can be driven by means of the 
me ta- information in the text, as movement of objects or change of 
the viewpoint of the shot. 
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The output signal of the block 320 is constituted by a 
final three-dimensional animation signal TRD destined to be sent 
to a block 322 tasked with the rendering operation, i.e. with the 
operation of representing on a screen, as a pixel matrix, the 
s three-dimensional scene constituted by the character and by the 
background. The sequence of the pixel matrix, obtained at 
regular time intervals, constitutes the output of the block. The 
output of the rendering block 322 is constituted by a sequence of 
video frames of the animation indicated as FV. 
10 The sampling rate of the video frames is a parameter 

that is typically set in preferred fashion to 25 Hz. 

The signal FV is fed as an input to an additional block 
324 destined to receive also the text with emoticons TE generated 
by the block 302. 

is The block 324 distributes the text among the various 

frames constituting the video animation produced, the operation 
is optional and is performed when an MMS message without audio is 
to be generated, i.e. an MMS message in which the SMS message is 
shown in the form of text and animation. 

20 The output of the block 324 is constituted by the set 

of all movements of the character and of the scene, the signal 
FVT, corresponding in practice to the sequence of the video 
frames with the text, is fed to a video coding block 326 destined 
to receive as its input, in addition to the signal FVT, also the 

25 signal V pertaining to the synthesized voice as well as the 
information TV pertaining to the type of terminal of the 
recipient. 

The embodiment shown in FIGS. 3A and 3B refers to a 
solution in which the information is made available at the level 
30 of the module 16. the information generally indicates brand and 
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model name of the recipient terminal (for example, Sony Ericsson 
T68i, Nokia 7650, etc.) . 

The block 326 proceeds in this case by creating the 
video clip directly in a format suitable to be viewed from the 
s recipient terminal in question. The adaptation of the video clip 
to a determined type of terminal can influence, for example, on 
the spatial and time resolution of the frames , on whether the 
audio channel is inserted or not, etc. 

The solution whereto reference is made herein therefore 
10 provides for integrating the SMS message with a video content 

generated in this way so that the resulting multimedia message, 
generated by the module 16, is in a format suitable for being 
viewed from the terminal. 

As stated previously, the solution according to the 
is invention can, however, also be implemented in conditions in 
which the module 16 (and, therefore, the block 326, in the 
embodiment illustrated herein) does not carry out any 
"specialization" action of this kind. 

In this case, the video clip, or in general the video 
20 content destined to complement the incoming SMS text message, is 
generated in a standard format, i.e. without taking into account 
the characteristics of the recipient terminal. 

The related format conversion, destined to make the 
final MMS message actually viewable by the recipient terminal, is 
25 then left to the module 10 (Figure 1) with MMS relay/server 
functions . 

In the embodiment illustrated herein (which is in fact 
an example) the output signal from the block 326 is then 
constituted by a signal VC essentially similar to a video clip in 
30 compressed format, signal is transmitted to a block 328 destined 
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to construct, starting from the multimedia message carried at its 
input, a message corresponding to the MMS standard. 

To proceed in this way, the block 328 receives at its 
input, in addition to the signal VC output by the block 326, also 
s the signal TE corresponding to the text with emoticon generated 

by the block 302 , the signal pertaining to the recipient D coming 
from the block 300, as well as the information about the sender 
S: the latter information is derived from the center 17 of Figure 
1 according to known criteria, requiring no detailed description 
10 herein. 

To generate the MMS message, destined to be sent to the 
module 10, the block 328 inserts the video animation previously 
computed in an MMS message. This preferably takes place using 
the SMIL language of description of the scene and joining various 

is multimedia objects in a single form comprising multiple parts. 

The block 328 also inserts in the message header the 
information about the sender, recipient and subject. The subject 
is constructed automatically using the first characters 
constituting the text with emoticons . 

20 Preferably, the block 328 is also destined to co- 

operate with an additional database 330 constituted by a 
collection of images to be inserted in the MMS message as 
"logos" or advertising, or as sounds able to be used as 
background music for the scene or as advertising jingles. 

25 Naturally, without changing the principle of the 

invention, the details of its implementation and the embodiments 
may be amply varied with respect to what is described and 
illustrated herein purely by way of example, without thereby 
departing from the scope of the present invention. This holds 

30 true in particular, but not exclusively, for the possibility of 
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applying the invention to convert into MMS messages text messages 
generated other than by an SMS, for instance in the form of 
e-mail messages, and to the possibility of applying the invention 
to the transmission of MMS messages on other than UMTS networks. 



- 25 - 



23203AP2.wpd 



